1/9 




FIG. 1 



102-\ 

COMBINE & CATENATE 
ALL COMBINATIONS OF 
OVERLAPPING ALIGNMENTS 
THAT AGREE WITH EACH OTHER 

104 -\ I 
EXTEND THE BOUNDRIES OF 
OVERLAPPING ALIGNMENTS THAT 
AGREE WITH THEIR FIRST 
& LAST EXONS 
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Input: 
A 
B 
C 
D 
E 

After Step 1: 
A-B 
A-C 

E 



40 ~\ 

"Normal" gene splicing [ 

41-^ 

Cassette Exon [ 

Shifted splice site [ 

Cassette Exon w/ shift C 
44^ 

Variable exons 
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After Combining: 



A [ 
A-B C 
A-C [ 
A-B-C : 
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After Combining 
(3 examples): 
A-B [= 
B-C [= 
A-B-Cl= 
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200^ 

READ INPUT GFF FILE 
202^ 1 

POPULATE AN ARRAY 
OF DATA STRUCTURES 



204^ | 



STORE EACH ALIGNMENT AS 
SET OF ALIGNMENT BLOCKS 


206^ \ 


EACH BLOCK REPRESENTS 
A MATCHING REGION 




208^ | 




AS ALIGMENTS ARE READ 








DETERMINE IF GAPS ARE 
INTRONS OR INSERTS 







ELIMINATE SINGLE 
EXON ALIGNMENTS 




214-\ I 




TRIM SOFT ENDS 
OF ALIGNMENTS 




216^ | 


FILTER OUT HIGHLY 
SIMILAR ALIGNMENTS 



FIG. 7 



5/9 




After Filtering: 
A (+B) m 




After Merging: 



2 C 
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Input: 



B O- 
C 




Results if alignment B 
is discarded early: 




Results if alignment B 
is discarded later: 



A 
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400- 



PERFORM PAIRWISE 
COMPARISON OF 
ALL ALIGNMENTS 



402 



FOR EACH PAIR CONSIDER 
LEFT-MOST & RIGHT-MOST 
OVERLAPPING EXONS 



406- 



tONFILCTS^ 
IN HARD 
V EDGE?^ 

Iyes 



DO NOT USE 
COMBINATION- 
PLACE ON DONE LIST 



410 



PURGE REDUNDANT 
ALIGNMANTS 



v NO 



r-412 



EXTEND SHORTER ENDED 
ALIGNMENTS TO MATCH 
LONGER ALIGNMENTS 



414 



CREATE NEW 
ALIGNMENT 
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